Picture for Jiangyan Yi

Jiangyan Yi

Edit Content, Preserve Acoustics: Imperceptible Text-Based Speech Editing via Self-Consistency Rewards

Add code
Jan 31, 2026
Viaarxiv icon

OV-InstructTTS: Towards Open-Vocabulary Instruct Text-to-Speech

Add code
Jan 04, 2026
Viaarxiv icon

$\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection

Add code
May 16, 2025
Figure 1 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Figure 2 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Figure 3 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Figure 4 for $\mathcal{A}LLM4ADD$: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Viaarxiv icon

Region-Based Optimization in Continual Learning for Audio Deepfake Detection

Add code
Dec 16, 2024
Figure 1 for Region-Based Optimization in Continual Learning for Audio Deepfake Detection
Figure 2 for Region-Based Optimization in Continual Learning for Audio Deepfake Detection
Figure 3 for Region-Based Optimization in Continual Learning for Audio Deepfake Detection
Figure 4 for Region-Based Optimization in Continual Learning for Audio Deepfake Detection
Viaarxiv icon

Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio

Add code
Dec 02, 2024
Figure 1 for Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio
Figure 2 for Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio
Figure 3 for Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio
Figure 4 for Reject Threshold Adaptation for Open-Set Model Attribution of Deepfake Audio
Viaarxiv icon

Unification of Balti and trans-border sister dialects in the essence of LLMs and AI Technology

Add code
Nov 20, 2024
Viaarxiv icon

From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language

Add code
Nov 20, 2024
Figure 1 for From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language
Figure 2 for From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language
Figure 3 for From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language
Figure 4 for From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language
Viaarxiv icon

WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification

Add code
Sep 18, 2024
Figure 1 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Figure 2 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Figure 3 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Figure 4 for WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification
Viaarxiv icon

VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing

Add code
Aug 11, 2024
Viaarxiv icon

ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild

Add code
Aug 09, 2024
Figure 1 for ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild
Figure 2 for ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild
Figure 3 for ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild
Figure 4 for ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild
Viaarxiv icon